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Abstract 

We present a correction method for the density of states (DOS) obtained from the generalized 
ensemble simulations. The DOS is proportionally corrected to match the exact values and/or good 
approximations known for the system. We demonstrate the validity of the method by applying 
it to the DOS of 2D Potts model calculated from various generalized ensemble simulations. It is 
shown that the root-mean-square error of the DOS is reduced by ~ 50% or more without additional 
heavy calculations. 
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Generalized ensemble Monte Carlo (MC) algorithms, such as the multicanonical method 
PP, the Wang-Landau method [2], and variations thereof [SHTj, are effective simulational tools 
for probing complex systems that are often characterized by their rough energy landscape. 
Unlike the traditional Metropolis method that simulates canonical ensembles at distinct 
temperatures, these methods simulate a flat histogram ensemble to calculate the density of 
states (DOS) of a wide energy range. Previously, efforts have been mainly focused on opti- 
mizing the performance of the simulation: either to change the dynamics [3 [9] or to change 
the simulated statistical ensemble [Uj of the simulation. We show here a different approach 
to achieve higher accuracy and efficiency. The DOS could be proportionally corrected to 
match the exact values and/or good approximations known for the system to improve the 
overall accuracy without heavy calculations. The validity of the method is demonstrated by 
applying it to the DOS of 2D Potts model calculated from multicanonical and 1/t WL [7j 
simulations. 

The errors in the absolute DOS for a Q=8 2D Potts model of lattice size N = 8 x 8 
(hereafter "the Potts model") calculated from a 1/t WL simulation of t = 10 7 MC moves 
(flips) is shown in Fig. 1 as the solid curve. Since the simulation provides only a relative DOS 
for different energies, the absolute DOS is extracted using the fact that the number of ground 
stats (where E\ = —2N) is Q and is hereafter referred to as DOS for simplicity. Throughout 
this paper, we use the absolute error defined as e(E) = S(E) — S e (E) = \n[g(E)]—\n[g e (E)} in 
comparing the simulational (g(E)) and the exact (g e (E)) DOS values. Here the "exact" DOS 
(S e (E) = \n[g e (E)]) is calculated from a long run of 1/t Wang-Landau simulation of t — 10 12 
flips and we have tested that our discussion below is not affected by the errors remained 
in S e (E). One may note that the curve differs from a white noise that has no correlation 
between different values. Actually, a correlation between the errors is explicitly revealed in 
Fig. 2 where the correlation coefficient calculated as R e (AE) = J2e £(E)e(E+AE) is plotted 
as a function of the energy difference (AE). For calculating the correlation coefficient, the 
DOS is normalized in a way that the average of the errors is zero. This correlation between 
the errors suggests a possibility to correct the whole DOS using the knowledge about certain 
errors. 

For many systems, more than one exact DOS values and/ or good approximations 
are known. Actually, besides the rescaling condition used above, the DOS of the Potts 
model could also be normalized using the fact that the total number of possible states 
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is Y,E9n(E) = Q N 0. While the first rescaling condition (ln[g n {E)] = ln[g(E)] - 
\n[g(Ei)] + ln(Q)) guarantees the accuracy of the DOS at low energy levels, the second 
one (ln[g n (.E)] = ln[g(i?)] — lnf^^ g(E)} + A^ln[Q]) ensures the accuracy of the maximum 
DOS values as the summation Y^e9(E) is dominated by those values. (We denote the 
maximum DOS value as g(E n ).) The discrepancy between these two rescaling conditions, 
e(E n ) - e{E x ) = (N\n[Q] - ln[£ B #(£)]) - (ln(Q) - ln[#(£i)]), gives the absolute error in 
the maximum DOS values. With the knowledge of the error e(Ei) and e(E n ), we propose 
a linear correction to all the DOS values S(Ei) ( i = l,2,3...n) within the energy range 
[Ei, E n ], 

S c (Ei) = S(Ei) + e(E x ) + [e(E n ) - e{E x )]/(n - 1) x (i - 1) (1) 

We are goingto show that this simple correction that stretches or compresses the DOS curve 
proportionally to match the exact values improves the overall accuracy of the DOS. 

Shown in Fig. 3 are the root-mean-square (rms) error in both the corrected and non- 
corrected DOS of the Potts model as a function of simulation time. Here the rms error 
calculated as &(E) = ^ ^^^^ Se( - El ^ ~ j s a benchmark of the errors in the n DOS values 
within the energy range of [Ei,E n ]. To put the comparison on the same base, the non- 
corrected DOS is normalized using the same e(Ei). We find that in average the correction 
reduces the rms error by 50%, independent to the simulation time and model size. In other 
words, this correction could save 75% of the simulation time to achieve the same overall ac- 
curacy, given that a oc 1/ \fi. Obviously, the correction could be extended straightforwardly 
to the case that even more exact DOS values are known. As a simple test, for the DOS of a 
2D Ising model of various size, we achieve a reduction in rms error by 65% by applying the 
correction to both halves of the DOS around the peak. 

To reduce the error caused by the local fluctuations in the simulated DOS and make s(E) 
more representative of the overall error that we should correct, the average of the errors in 
several neighboring g(E) could be used. As in the case above, the error calculated from 
e(E n ) = N \n[Q] — ln[J]£ 9{E)] automatically makes use of the several maximum DOS values 
that are significant to determine the total number of possible states and is thus a better 
choice than that simply calculated as e(E n ) = g e (E n ) —g(E n ). We remark that g e (E n ) could 
be calculated to sufficient accuracy using a Metropolis simulation of the system at infinite 
temperature with slight calculations, and could be used together with g e (Ei) to make a 
better initial guess for the simulations.) Also, it is trivial to find out for the Potts model the 
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g(E) of the first, second, and third excited energy levels, g(E 2 = —2N + 4) = NQ(Q — 1), 
g(E 3 = -2N + 6) = 2NQ(Q - 1), and g(E 4 = -2N + 7) = 2NQ(Q - 1)(Q - 2). we thus 
use e(Ei) = [e(Ei) + e(E 2 ) + £(£3) + e(E 4 )}/4 in our correction. 

A more direct test of the validity of the correction might be comparing the accuracy of the 
thermaldynamic values calculated from the corrected and non-corrected DOS. For the Potts 
model, we show in Fig. 4 the histogram of the temperatures of the peak of the specific heat 
(T c ) obtained from 100,000 realizations of malticanonical and 1/t WL simulation. For both 
simulations the correction is more effective than doubling the simulation time. The gain in 
the efficiency for calculating T c , however, is not so large as that we find for cr(E). This is 
because the correction that improves the overall accuracy of the DOS is more effective to the 
value that depends on "long-range" accuracy of the DOS. And note that while the cr(E) is 
calculated for the entire corrected DOS curve, the critical temperature is only determined by 
about half of the curve [12J. As noted before [13], the DOS resulted from 1/t WL simulation 
has systematic errors that cause the deviation of the center of the distribution from the 
exact value. For this case, not only the correction sharpen the distribution but also shift the 
center toward the correct value significantly, showing that the correction is also effective to 
reduce the systematic errors in the DOS. We want to point out that the better results from 
the multicanonical simulations is because it uses a constant weighting gi(E) obtained from 
a 1/t WL simulation using 2 6 flips, while the 1/t WL simulations start from a initial guess 

g(E) = 0. 

In summary, we propose a correction method for the DOS calculated from generalized 
ensemble simulation. The DOS could be corrected to match the exact values and/or good 
approximations known for the system to improve the overall accuracy. Applying the method 
to 2D Potts model and Ising model, we achieve a reduction in the rms error by 50% and 
65%, respectively. Comparing to the simulation effort, this correction hardly costs any time. 
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FIG. 1: Errors in a simulated DOS of the 8x8 Q=8 Potts model as a function of energy per lattice 
site, E/N (solid curve). The DOS is calculated by a 1/t WL simulation using a single run of 10 7 
MC moves and is normalized using the fact that the number of ground states is Q. The errors 
after correcting the straight line connecting the two error points {E/N = — 2, —0.25 and marked 
as circles) whose values are known is shown as the dashed curve within the energy range [-2,-0.25]. 
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FIG. 2: Correlation coefficient of the errors in the DOS of the Potts model as a function of energy 
difference. The coefficient is calculated using 10,000 DOS obtained from independent 1/t WL 
simulations of t = 10 7 flips 
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FIG. 3: The rms error of the DOS of the Potts model as a function of multicanonical simulation 
time. Each curve is the average of 100 realizations. 
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FIG. 4: Left panel: Histograms of the location of the peak of the specific heat for the Potts model 
resulted from 100,000 realizations of multicanonical simulations. The solid curve is calculated 
from the non-corrected DOS obtained by simulations using 2 x 10 6 flips. The dashed and dotted 
curve (almost overlapping with each other) correspond to applying the correction and tripling the 
simulation time to 6 x 10 6 flips, respectively. Right panel: Same as the left panel but for 1/t WL 
simulations. The dotted curve correspond to doubling the simulation time to 4 x 10 6 flips. See 
text for more details. 
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